Search CORE

156 research outputs found

On the importance of being finished

Author: McCombie W Richard
Palmer Lance E
Publication venue: BioMed Central
Publication date: 01/01/2002
Field of study

The publication of an increasing number of draft genome sequences presents problems that will only be resolved by improved search tools and by complete finishing of the sequences - and their deposition in publicly accessible databases

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

A comparative analysis of exome capture

Author: Grabill Ian
Iossifov Ivan
Kramer Melissa
McCombie W Richard
Parla Jennifer S
Spector Mona S
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

ABSTRACT: BACKGROUND: Human exome resequencing using commercial target capture kits has been and is being used for sequencing large numbers of individuals to search for variants associated with various human diseases. We rigorously evaluated the capabilities of two solution exome capture kits. These analyses help clarify the strengths and limitations of those data as well as systematically identify variables that should be considered in the use of those data. RESULTS: Each exome kit performed well at capturing the targets they were designed to capture, which mainly corresponds to the consensus coding sequences (CCDS) annotations of the human genome. In addition, based on their respective targets, each capture kit coupled with high coverage Illumina sequencing produced highly accurate nucleotide calls. However, other databases, such as the Reference Sequence collection (RefSeq), define the exome more broadly, and so not surprisingly, the exome kits did not capture these additional regions. CONCLUSIONS: Commercial exome capture kits provide a very efficient way to sequence select areas of the genome at very high accuracy. Here we provide the data to help guide critical analyses of sequencing data derived from these products

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Springer - Publisher Connector

PubMed Central

EST analysis in Ginkgo biloba: an assessment of conserved developmental regulators and gymnosperm specific genes

Author: Brenner Eric D
Coruzzi Gloria M
Douglas Andrew W
Katari Manpreet S
McCombie WR
Moss Walter N
Rudd Stephen A
Runko Suzan J
Stellari Giulia M
Stevenson Dennis W
Twigg Richard W
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: Ginkgo biloba L. is the only surviving member of one of the oldest living seed plant groups with medicinal, spiritual and horticultural importance worldwide. As an evolutionary relic, it displays many characters found in the early, extinct seed plants and extant cycads. To establish a molecular base to understand the evolution of seeds and pollen, we created a cDNA library and EST dataset from the reproductive structures of male (microsporangiate), female (megasporangiate), and vegetative organs (leaves) of Ginkgo biloba. RESULTS: RNA from newly emerged male and female reproductive organs and immature leaves was used to create three distinct cDNA libraries from which 6,434 ESTs were generated. These 6,434 ESTs from Ginkgo biloba were clustered into 3,830 unigenes. A comparison of our Ginkgo unigene set against the fully annotated genomes of rice and Arabidopsis, and all available ESTs in Genbank revealed that 256 Ginkgo unigenes match only genes among the gymnosperms and non-seed plants – many with multiple matches to genes in non-angiosperm plants. Conversely, another group of unigenes in Gingko had highly significant homology to transcription factors in angiosperms involved in development, including MADS box genes as well as post-transcriptional regulators. Several of the conserved developmental genes found in Ginkgo had top BLAST homology to cycad genes. We also note here the presence of ESTs in G. biloba similar to genes that to date have only been found in gymnosperms and an additional 22 Ginkgo genes common only to genes from cycads. CONCLUSION: Our analysis of an EST dataset from G. biloba revealed genes potentially unique to gymnosperms. Many of these genes showed homology to fully sequenced clones from our cycad EST dataset found in common only with gymnosperms. Other Ginkgo ESTs are similar to developmental regulators in higher plants. This work sets the stage for future studies on Ginkgo to better understand seed and pollen evolution, and to resolve the ambiguous phylogenetic relationship of G. biloba among the gymnosperms

Cold Spring Harbor Laboratory Institutional Repository

Springer - Publisher Connector

PubMed Central

Syntenic Relationships between Medicago truncatula

Author: Dong-Jin Kim
Douglas R. Cook
Helge Küester
Hong-Kyu Choi
Hongyan Zhu
Hui-Mei Peng
Jong-Min Baek
Leland C. Ellis
W. Richard McCombie
Publication venue: 'American Society of Plant Biologists (ASPB)'
Publication date
Field of study

Crossref

Phylogenomic analysis of transcriptome data elucidates co-occurrence of a paleopolyploid event and the origin of bimodal karyotypes in Agavoideae (Asparagaceae)

Author: Ayyampalayam Saravanaraj
Chase Mark W.
de Pamphilis Claude W.
dePamphilis Claude W.
Leebens-Mack Jim
McCombie W. Richard
McKain Michael R.
Pires J. Chris
Wickett Norman
Zhang Yeting
Publication venue: 'Botanical Society of America'
Publication date: 01/02/2012
Field of study

Premise of the study: The stability of the bimodal karyotype found in Agave and closely related species has long interested botanists. The origin of the bimodal karyotype has been attributed to allopolyploidy, but this hypothesis has not been tested. Next-generation transcriptome sequence data were used to test whether a paleopolyploid event occurred on the same branch of the Agavoideae phylogenetic tree as the origin of the Yucca-Agave bimodal karyotype. Methods: Illumina RNA-seq data were generated for phylogenetically strategic species in Agavoideae. Paleopolyploidy was inferred in analyses of frequency plots for synonymous substitutions per synonymous site (K-s) between Hosta, Agave, and Chlorophytum paralogous and orthologous gene pairs. Phylogenies of gene families including paralogous genes for these species and outgroup species were estimated to place inferred paleopolyploid events on a species tree. Key results: K-s frequency plots suggested paleopolyploid events in the history of the genera Agave, Hosta, and Chlorophytum. Phylogenetic analyses of gene families estimated from transcriptome data revealed two polyploid events: one predating the last common ancestor of Agave and Hosta and one within the lineage leading to Chlorophytum. Conclusions: We found that polyploidy and the origin of the Yucca-Agave bimodal karyotype co-occur on the same lineage consistent with the hypothesis that the bimodal karyotype is a consequence of allopolyploidy. We discuss this and alternative mechanisms for the formation of the Yucca-Agave bimodal karyotype. More generally, we illustrate how the use of next-generation sequencing technology is a cost-efficient means for assessing genome evolution in nonmodel species

Cold Spring Harbor Laboratory Institutional Repository

ZENODO

Dryad Digital Repository (Duke University)

Electronic Archiving System

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Genome and transcriptome of the regeneration-competent flatworm, Macrostomum lignano.

Author: Battistoni Giorgia
Delás M Joaquina
El Demerdash Osama
Falciatori Ilaria
Gurtowski James
Hannon Gregory J
Ladurner Peter
McCombie W Richard
Ramos Olivia Mendivil
Schatz Michael
Schärer Lukas
Smith Andrew D
Vizoso Dita B
Wasik Kaja
Zhou Xin
Publication venue: Proc Natl Acad Sci U S A
Publication date: 21/09/2015
Field of study

The free-living flatworm, Macrostomum lignano has an impressive regenerative capacity. Following injury, it can regenerate almost an entirely new organism because of the presence of an abundant somatic stem cell population, the neoblasts. This set of unique properties makes many flatworms attractive organisms for studying the evolution of pathways involved in tissue self-renewal, cell-fate specification, and regeneration. The use of these organisms as models, however, is hampered by the lack of a well-assembled and annotated genome sequences, fundamental to modern genetic and molecular studies. Here we report the genomic sequence of M. lignano and an accompanying characterization of its transcriptome. The genome structure of M. lignano is remarkably complex, with ∼75% of its sequence being comprised of simple repeats and transposon sequences. This has made high-quality assembly from Illumina reads alone impossible (N50=222 bp). We therefore generated 130× coverage by long sequencing reads from the Pacific Biosciences platform to create a substantially improved assembly with an N50 of 64 Kbp. We complemented the reference genome with an assembled and annotated transcriptome, and used both of these datasets in combination to probe gene-expression patterns during regeneration, examining pathways important to stem cell function.This work is supported by National Institutes of Health Grants R37 GM062534 (to G.J.H.) and R01-HG006677 (to M.S.); National Science Foundation Grant DBI-1350041 (to M.S.); and a Swiss National Science Foundation Grant 31003A-143732 (to L.S.). This work was performed with assistance from Cold Spring Harbor Laboratory Shared Resources, which are funded, in part, by Cancer Center Support Grant 5P30CA045508.This is the final version of the article. It first appeared from PNAS via http://dx.doi.org/10.1073/pnas.151671811

Cold Spring Harbor Laboratory Institutional Repository

edoc

PubMed Central

Apollo (Cambridge)

DNA sequence level analyses reveal potential phenotypic modifiers in a large family with psychiatric disorders

Author: Arnau-Soler Aleix
Blackwood Douglas H. R.
Davies Gail
Deary Ian J.
Duff Barbara
Evans Kathryn L
Ghiban Elena
Hayward Caroline
Kramer Melissa
Lawrie Stephen M.
Lihm Jayon
McCarthy Shane
McCombie W. Richard
McIntosh Andrew M.
Morris Stewart
Porteous David J
Ryan Niamh M.
Thomson Pippa A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/06/2018
Field of study

Psychiatric disorders are a group of genetically related diseases with highly polygenic architectures. Genome-wide association analyses have made substantial progress towards understanding the genetic architecture of these disorders. More recently, exome- and whole-genome sequencing of cases and families have identified rare, high penetrant variants that provide direct functional insight. There remains, however, a gap in the heritability explained by these complementary approaches. To understand how multiple genetic variants combine to modify both severity and penetrance of a highly penetrant variant, we sequenced 48 whole genomes from a family with a high loading of psychiatric disorder linked to a balanced chromosomal translocation. The (1;11)(q42;q14.3) translocation directly disrupts three genes: DISC1, DISC2, DISC1FP and has been linked to multiple brain imaging and neurocognitive outcomes in the family. Using DNA sequence-level linkage analysis, functional annotation and population-based association, we identified common and rare variants in GRM5 (minor allele frequency (MAF) > 0.05), PDE4D (MAF > 0.2) and CNTN5 (MAF < 0.01) that may help explain the individual differences in phenotypic expression in the family. We suggest that whole-genome sequencing in large families will improve the understanding of the combined effects of the rare and common sequence variation underlying psychiatric phenotypes

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Edinburgh Research Explorer

Deciphering the genome structure and paleohistory of _Theobroma cacao_

Author: Ang&#xe9
Ange Marie Risterucci
Anne Dievart
Aur&#xe9
Bertrand Pitollat
Christopher Viot
Claire Lanaud
Cristian Chaparro
Dave Kudrna
Didier Clement
Diogenes Infante
Dominique Brunel
Emmanuel Guiderdoni
Erika Sallet
Florent Murat
Francis Quetier
Francois Sabot
Gaetan Droc
Ismael Kebe
Jean Marc Aury
Jerome Gouzy
Jerome Salse
Jetty Siva S. Ammiraju
John E. Carlson
Jose Fernandes Barbosa-Neto
Joseph Moroh Akaza
Julie Poulain
Karina Gramacho
Laura Gelley
Maguy Rodier-Goud
Manuel Ruiz
Mark Guiltinan
Mathias Tahi
Mathilde Allegre
Melissa Kramer
Michael Abrouk
Michael Axtell
Michel Boccara
Mickael Bourge
Olivier Fouet
Olivier Panaud
Patrick Wincker
Pierre Costet
Rod Wing
Ronan Rivalan
Schiex T.
Siela Maximova
Spencer Brown
Stephan C. Schuster
Stephanie Sidibe-Bocs
Thierry Legavre
Valentin Guignon
W. Richard McCombie
Wolfgang Golser
Xavier Argout
Xavier Sabau
Xiang Song
Yolande Roguet
Yufan Zhang
Zhaorong Ma
Zi Sh
Publication venue
Publication date: 15/09/2010
Field of study

We sequenced and assembled the genome of _Theobroma cacao_, an economically important tropical fruit tree crop that is the source of chocolate. The assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of them anchored on the 10 _T. cacao_ chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example flavonoid-related genes. It also provides a major source of candidate genes for _T. cacao_ disease resistance and quality improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten _T. cacao_ chromosomes were shaped from an ancestor through eleven chromosome fusions. The _T. cacao_ genome can be considered as a simple living relic of higher plant evolution

Crossref

Nature Precedings

A genome triplication associated with early diversification of the core eudicots

Author: Ayyampalayam Saravanaraj
Bowers John E
Carpenter Eric J
Chanderbali Andre S
dePamphilis Claude W
Deyholos Michael K
Jiao Yuannian
Kutchan Toni M
Leebens-Mack Jim
McCombie Richard
McKain Michael R
McNeal Joel
Pires J Chris
Rolf Megan
Ruzicka Daniel R
Soltis Douglas E
Soltis Pamela S
Stevenson Dennis W
Wafula Eric
Wang Jun
Wickett Norman J
Wong Gane Ka-Shu
Wu Xiaolei
Zhang Yeting
Zhang Yong
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Background: Although it is agreed that a major polyploidy event, gamma, occurred within the eudicots, the phylogenetic placement of the event remains unclear. Results: To determine when this polyploidization occurred relative to speciation events in angiosperm history, we employed a phylogenomic approach to investigate the timing of gene set duplications located on syntenic gamma blocks. We populated 769 putative gene families with large sets of homologs obtained from public transcriptomes of basal angiosperms, magnoliids, asterids, and more than 91.8 gigabases of new next-generation transcriptome sequences of non-grass monocots and basal eudicots. The overwhelming majority (95%) of well-resolved gamma duplications was placed before the separation of rosids and asterids and after the split of monocots and eudicots, providing strong evidence that the gamma polyploidy event occurred early in eudicot evolution. Further, the majority of gene duplications was placed after the divergence of the Ranunculales and core eudicots, indicating that the gamma appears to be restricted to core eudicots. Molecular dating estimates indicate that the duplication events were intensely concentrated around 117 million years ago. Conclusions: The rapid radiation of core eudicot lineages that gave rise to nearly 75% of angiosperm species appears to have occurred coincidentally or shortly following the gamma triplication event. Reconciliation of gene trees with a species phylogeny can elucidate the timing of major events in genome evolution, even when genome sequences are only available for a subset of species represented in the gene trees. Comprehensive transcriptome datasets are valuable complements to genome sequences for high-resolution phylogenomic analysis

Crossref

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

Copenhagen University Research Information System

Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana

Author: Antonio Baltazar A.
Aono Hideo
Apweiler Rolf
Barrero Roberto A.
Bruskiewich Richard
Bureau Thomas
Burr Benjamin
Burr Frances
Costa de Oliveira Antonio
Fujii Yasuyuki
Fuks Galina
Gojobori Takashi
Habara Takuya
Haberer Georg
Han Bin
Harada Erimi
Higo Kenichi
Hilton Phillip B.
Hiraki Aiko T.
Hirochika Hirohiko
Hoen Douglas
Hokari Hiroki
Hosokawa Satomi
Hsing Yue
Ikawa Hiroshi
Ikeo Kazuho
Imanishi Tadashi
Ito Yukiyo
Itoh Takeshi
Jaiswal Pankaj
Kanno Masako
Kawahara Yosihiro
Kawamura Toshiyuki
Kawashima Hiroaki
Khurana Jitendra P.
Kikuchi Shoshi
Komatsu Setsuko
Koyanagi Kanako O.
Kubooka Hiromi
Liberherr Damien
Lin Yao-Cheng
Lonsdale David
Matsumoto Takashi
Matsuya Akihiro
McCombie W. Richard
Messing Joachim
Miyao Akio
Mulder Nicola
Nagamura Yoshiaki
Nam Jongmin
Namiki Nobukazu
Numa Hisataka
Nurimoto Shin
O'Donovan Claire
Ohyanagi Hajimi
Okido Toshihisa
OOta Satoshi
Osato Naoki
Palmer Lance E.
Quetier Francis
Raghuvanshi Surabh
Saichi Naomi
Sakai Hiroaki
Sakai Yasumichi
Sakata Katsumi
Sakurai Tetsuya
Saski Takuji
Sato Fumihiko
Sato Yoshiharu
Schoof Heiko
Seki Motoaki
Shibata Katsumi
Shibata Michie
Shimizu Yuji
Shinozaki Kazuo
Shinso Yuji
Singh Nagendra K.
Smith-White Brian
Takeda Jun-ichi
Tanaka Tsuyoshi
Tanino Motohiko
Tatusova Tatiana
Thongjuea Supat
Todokoro Fusano
Tsugane Mika
Tyagi Akhilesh K.
Vanavichit Apichart
Wang Aihui
Wing Rod A.
Yamaguchi Kaori
Yamamoto Mayu
Yamamoto Naoyuki
Yamasaki Chisato
Yu Yeisoo
Zhang Hao
Zhao Qiang
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/01/2007
Field of study

We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ~32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene

Crossref

PubMed Central

Queensland University of Technology ePrints Archive

Caltech Authors

University of Queensland eSpace